Exploiting Natural Language Structures in Software Informal Documentation

نویسندگان

چکیده

Communication means, such as issue trackers, mailing lists, Q&A forums, and app reviews, are premier means of collaboration among developers, between developers end-users. Analyzing sources information is crucial to build recommenders for example suggesting experts, re-documenting source code, or transforming user feedback in maintenance evolution strategies developers. To ease this analysis, previous work we proposed Development Emails Content Analyzer (DECA), a tool based on Natural Language Parsing that classifies with high precision development emails' fragments according their purpose. However, DECA has be trained through manual tagging relevant patterns, which often effort-intensive, error-prone requires specific expertise natural language parsing. In paper, first show, an empirical study, the extent producing rules identifying patterns effort, depending nature complexity patterns. Then, propose approach, named Nlp-based softwarE dOcumentation aNalyzer (NEON), automatically mines rules, minimizing effort. We assess performances NEON analysis classification mobile discussions, issues. simplifies identification definition processes, allowing savings more than 70 percent time otherwise spent performing activities manually. Results also show NEON-generated close manually identified ones, achieving comparable recall.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Hypertext Structures and Software Documentation

Software documentation represents a critical resource to the successful functioning of many enterprises. However, because it is static, documentation often fails to meet the needs of the many diverse users who are required to consult it on a regular basis in the course of their daily work. Software documentation is a rich resource that has not been fully exploited. Treatment of software documen...

متن کامل

Applying Hypertext Structures to Software Documentation

Software documentation represents a critical resource to the successful functioning of many enterprises. However, because it is static, documentation often fails to meet the needs of the many diverse users who are required to consult it on a re gular basis in the course of their daily work. Software documentation is a rich resource that has not been fully exploited. Treatment of software docume...

متن کامل

Natural language processing for documentation analysis

In view of the increasing interest in ontologies as a source of world knowledge, this deliverable presents different types of ontologies and describes the approach adopted within the Klase project towards the problem of mapping specialized linguistic ontologies to generic resources. It reports on investigations related to the possibility of applying linguistic ontologies to the problem of inter...

متن کامل

Natural Language in Software Engineering

The large time and effort devoted to software maintenance can be reduced by providing software engineers with software tools that automate tedious, error-prone tasks. However, despite the prevalence of tools such as IDEs, which automatically provide program information and automated support to the developer, there is considerable room for improvement in the existing software tools. The authors’...

متن کامل

Exploiting Parse Structures for Native Language Identification

Attempts to profile authors according to their characteristics extracted from textual data, including native language, have drawn attention in recent years, via various machine learning approaches utilising mostly lexical features. Drawing on the idea of contrastive analysis, which postulates that syntactic errors in a text are to some extent influenced by the native language of an author, this...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE Transactions on Software Engineering

سال: 2021

ISSN: ['0098-5589', '1939-3520', '2326-3881']

DOI: https://doi.org/10.1109/tse.2019.2930519